Dataset statistics
| Number of variables | 31 |
|---|---|
| Number of observations | 4807996 |
| Missing cells | 13887314 |
| Missing cells (%) | 9.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.1 GiB |
| Average record size in memory | 248.0 B |
Variable types
| Numeric | 15 |
|---|---|
| Categorical | 16 |
dat_cadastramento_fam has a high cardinality: 4960 distinct values | High cardinality |
dat_alteracao_fam has a high cardinality: 1364 distinct values | High cardinality |
dat_atualizacao_familia has a high cardinality: 1376 distinct values | High cardinality |
nom_estab_assist_saude_fam has a high cardinality: 25584 distinct values | High cardinality |
nom_centro_assist_fam has a high cardinality: 3699 distinct values | High cardinality |
cd_ibge is highly correlated with cod_centro_assist_fam | High correlation |
estrato is highly correlated with id_familia | High correlation |
classf is highly correlated with id_familia | High correlation |
id_familia is highly correlated with estrato and 1 other fields | High correlation |
vlr_renda_media_fam is highly correlated with marc_pbf | High correlation |
cod_local_domic_fam is highly correlated with cod_abaste_agua_domic_fam and 2 other fields | High correlation |
qtd_comodos_domic_fam is highly correlated with qtd_comodos_dormitorio_fam | High correlation |
qtd_comodos_dormitorio_fam is highly correlated with qtd_comodos_domic_fam | High correlation |
cod_agua_canalizada_fam is highly correlated with cod_abaste_agua_domic_fam | High correlation |
cod_abaste_agua_domic_fam is highly correlated with cod_local_domic_fam and 2 other fields | High correlation |
cod_destino_lixo_domic_fam is highly correlated with cod_local_domic_fam and 2 other fields | High correlation |
cod_calcamento_domic_fam is highly correlated with cod_local_domic_fam and 1 other fields | High correlation |
cod_centro_assist_fam is highly correlated with cd_ibge | High correlation |
marc_pbf is highly correlated with vlr_renda_media_fam | High correlation |
cd_ibge is highly correlated with cod_centro_assist_fam | High correlation |
estrato is highly correlated with id_familia | High correlation |
classf is highly correlated with id_familia | High correlation |
id_familia is highly correlated with estrato and 1 other fields | High correlation |
vlr_renda_media_fam is highly correlated with marc_pbf | High correlation |
cod_local_domic_fam is highly correlated with cod_destino_lixo_domic_fam and 1 other fields | High correlation |
qtd_comodos_domic_fam is highly correlated with qtd_comodos_dormitorio_fam | High correlation |
qtd_comodos_dormitorio_fam is highly correlated with qtd_comodos_domic_fam | High correlation |
cod_agua_canalizada_fam is highly correlated with cod_abaste_agua_domic_fam | High correlation |
cod_abaste_agua_domic_fam is highly correlated with cod_agua_canalizada_fam | High correlation |
cod_destino_lixo_domic_fam is highly correlated with cod_local_domic_fam and 1 other fields | High correlation |
cod_calcamento_domic_fam is highly correlated with cod_local_domic_fam and 1 other fields | High correlation |
cod_centro_assist_fam is highly correlated with cd_ibge | High correlation |
marc_pbf is highly correlated with vlr_renda_media_fam | High correlation |
cd_ibge is highly correlated with cod_centro_assist_fam | High correlation |
classf is highly correlated with id_familia | High correlation |
id_familia is highly correlated with classf | High correlation |
vlr_renda_media_fam is highly correlated with marc_pbf | High correlation |
cod_local_domic_fam is highly correlated with cod_abaste_agua_domic_fam and 2 other fields | High correlation |
qtd_comodos_domic_fam is highly correlated with qtd_comodos_dormitorio_fam | High correlation |
qtd_comodos_dormitorio_fam is highly correlated with qtd_comodos_domic_fam | High correlation |
cod_agua_canalizada_fam is highly correlated with cod_abaste_agua_domic_fam | High correlation |
cod_abaste_agua_domic_fam is highly correlated with cod_local_domic_fam and 1 other fields | High correlation |
cod_destino_lixo_domic_fam is highly correlated with cod_local_domic_fam | High correlation |
cod_calcamento_domic_fam is highly correlated with cod_local_domic_fam | High correlation |
cod_centro_assist_fam is highly correlated with cd_ibge | High correlation |
marc_pbf is highly correlated with vlr_renda_media_fam | High correlation |
cod_local_domic_fam is highly correlated with cod_abaste_agua_domic_fam and 1 other fields | High correlation |
cod_abaste_agua_domic_fam is highly correlated with cod_local_domic_fam and 2 other fields | High correlation |
cod_banheiro_domic_fam is highly correlated with cod_especie_domic_fam | High correlation |
ind_familia_quilombola_fam is highly correlated with cod_familia_indigena_fam | High correlation |
cod_especie_domic_fam is highly correlated with cod_abaste_agua_domic_fam and 3 other fields | High correlation |
cod_familia_indigena_fam is highly correlated with ind_familia_quilombola_fam | High correlation |
cod_agua_canalizada_fam is highly correlated with cod_abaste_agua_domic_fam and 1 other fields | High correlation |
cod_calcamento_domic_fam is highly correlated with cod_local_domic_fam and 1 other fields | High correlation |
cd_ibge is highly correlated with id_familia and 2 other fields | High correlation |
estrato is highly correlated with id_familia | High correlation |
classf is highly correlated with id_familia | High correlation |
id_familia is highly correlated with cd_ibge and 3 other fields | High correlation |
vlr_renda_media_fam is highly correlated with marc_pbf | High correlation |
cod_local_domic_fam is highly correlated with cod_agua_canalizada_fam and 4 other fields | High correlation |
qtd_comodos_domic_fam is highly correlated with qtd_comodos_dormitorio_fam | High correlation |
qtd_comodos_dormitorio_fam is highly correlated with qtd_comodos_domic_fam | High correlation |
cod_material_piso_fam is highly correlated with cod_material_domic_fam | High correlation |
cod_material_domic_fam is highly correlated with cd_ibge and 2 other fields | High correlation |
cod_agua_canalizada_fam is highly correlated with cod_local_domic_fam and 3 other fields | High correlation |
cod_abaste_agua_domic_fam is highly correlated with cod_local_domic_fam and 1 other fields | High correlation |
cod_banheiro_domic_fam is highly correlated with cod_agua_canalizada_fam and 1 other fields | High correlation |
cod_escoa_sanitario_domic_fam is highly correlated with cod_local_domic_fam and 1 other fields | High correlation |
cod_destino_lixo_domic_fam is highly correlated with cod_local_domic_fam and 3 other fields | High correlation |
cod_calcamento_domic_fam is highly correlated with cod_escoa_sanitario_domic_fam and 1 other fields | High correlation |
cod_centro_assist_fam is highly correlated with cd_ibge and 2 other fields | High correlation |
ind_parc_mds_fam is highly correlated with cod_local_domic_fam | High correlation |
marc_pbf is highly correlated with vlr_renda_media_fam | High correlation |
qtd_comodos_domic_fam has 224026 (4.7%) missing values | Missing |
qtd_comodos_dormitorio_fam has 222998 (4.6%) missing values | Missing |
cod_material_piso_fam has 222349 (4.6%) missing values | Missing |
cod_material_domic_fam has 222349 (4.6%) missing values | Missing |
cod_agua_canalizada_fam has 222349 (4.6%) missing values | Missing |
cod_abaste_agua_domic_fam has 222349 (4.6%) missing values | Missing |
cod_banheiro_domic_fam has 222349 (4.6%) missing values | Missing |
cod_escoa_sanitario_domic_fam has 494595 (10.3%) missing values | Missing |
cod_destino_lixo_domic_fam has 222349 (4.6%) missing values | Missing |
cod_iluminacao_domic_fam has 222349 (4.6%) missing values | Missing |
cod_calcamento_domic_fam has 222350 (4.6%) missing values | Missing |
nom_estab_assist_saude_fam has 2441370 (50.8%) missing values | Missing |
cod_eas_fam has 2441370 (50.8%) missing values | Missing |
nom_centro_assist_fam has 3031030 (63.0%) missing values | Missing |
cod_centro_assist_fam has 3031030 (63.0%) missing values | Missing |
ind_parc_mds_fam has 155334 (3.2%) missing values | Missing |
id_familia has unique values | Unique |
vlr_renda_media_fam has 494189 (10.3%) zeros | Zeros |
ind_parc_mds_fam has 4244511 (88.3%) zeros | Zeros |
Reproduction
| Analysis started | 2022-08-24 23:16:32.974227 |
|---|---|
| Analysis finished | 2022-08-24 23:34:01.508735 |
| Duration | 17 minutes and 28.53 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 5534 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2993741.547 |
| Minimum | 1100015 |
|---|---|
| Maximum | 5300108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 36.7 MiB |
Quantile statistics
| Minimum | 1100015 |
|---|---|
| 5-th percentile | 1501303 |
| Q1 | 2311875.25 |
| median | 2927002 |
| Q3 | 3526209 |
| 95-th percentile | 5101803 |
| Maximum | 5300108 |
| Range | 4200093 |
| Interquartile range (IQR) | 1214333.75 |
Descriptive statistics
| Standard deviation | 937280.9281 |
|---|---|
| Coefficient of variation (CV) | 0.3130801085 |
| Kurtosis | 0.1068486716 |
| Mean | 2993741.547 |
| Median Absolute Deviation (MAD) | 604801 |
| Skewness | 0.393206125 |
| Sum | 1.439389738 × 1013 |
| Variance | 8.784955382 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3550308 | 233230 | 4.9% |
| 3304557 | 98360 | 2.0% |
| 2304400 | 73396 | 1.5% |
| 2927408 | 60027 | 1.2% |
| 1302603 | 45467 | 0.9% |
| 1501402 | 40950 | 0.9% |
| 2611606 | 38405 | 0.8% |
| 2111300 | 35663 | 0.7% |
| 3106200 | 29279 | 0.6% |
| 5300108 | 28538 | 0.6% |
| Other values (5524) | 4124681 |
| Value | Count | Frequency (%) |
| 1100015 | 361 | < 0.1% |
| 1100023 | 2753 | |
| 1100031 | 55 | < 0.1% |
| 1100049 | 2259 | |
| 1100056 | 271 | < 0.1% |
| 1100064 | 181 | < 0.1% |
| 1100072 | 111 | < 0.1% |
| 1100080 | 287 | < 0.1% |
| 1100098 | 435 | < 0.1% |
| 1100106 | 1471 |
| Value | Count | Frequency (%) |
| 5300108 | 28538 | |
| 5222302 | 153 | < 0.1% |
| 5222203 | 50 | < 0.1% |
| 5222054 | 169 | < 0.1% |
| 5222005 | 175 | < 0.1% |
| 5221908 | 108 | < 0.1% |
| 5221858 | 3275 | 0.1% |
| 5221809 | 49 | < 0.1% |
| 5221700 | 295 | < 0.1% |
| 5221601 | 1418 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 36.7 MiB |
| 2 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4807996 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 4085981 | |
| 1 | 722015 | 15.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2 | 4085981 | |
| 1 | 722015 | 15.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 4085981 | |
| 1 | 722015 | 15.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4807996 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 4085981 | |
| 1 | 722015 | 15.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4807996 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 4085981 | |
| 1 | 722015 | 15.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4807996 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 4085981 | |
| 1 | 722015 | 15.0% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 36.7 MiB |
| 3 | |
|---|---|
| 2 | |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4807996 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 2856584 | |
| 2 | 1023594 | 21.3% |
| 1 | 927818 | 19.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 3 | 2856584 | |
| 2 | 1023594 | 21.3% |
| 1 | 927818 | 19.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 2856584 | |
| 2 | 1023594 | 21.3% |
| 1 | 927818 | 19.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4807996 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2856584 | |
| 2 | 1023594 | 21.3% |
| 1 | 927818 | 19.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4807996 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 2856584 | |
| 2 | 1023594 | 21.3% |
| 1 | 927818 | 19.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4807996 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 2856584 | |
| 2 | 1023594 | 21.3% |
| 1 | 927818 | 19.3% |
id_familia
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIQUE| Distinct | 4807996 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2709538.437 |
| Minimum | 1 |
|---|---|
| Maximum | 5290701 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 36.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 284955.75 |
| Q1 | 1393348.75 |
| median | 2739219.5 |
| Q3 | 4049591.25 |
| 95-th percentile | 5044434.25 |
| Maximum | 5290701 |
| Range | 5290700 |
| Interquartile range (IQR) | 2656242.5 |
Descriptive statistics
| Standard deviation | 1530538.528 |
|---|---|
| Coefficient of variation (CV) | 0.5648705724 |
| Kurtosis | -1.204976871 |
| Mean | 2709538.437 |
| Median Absolute Deviation (MAD) | 1327558 |
| Skewness | -0.04921304009 |
| Sum | 1.302744997 × 1013 |
| Variance | 2.342548186 × 1012 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 3618383 | 1 | < 0.1% |
| 3618400 | 1 | < 0.1% |
| 3618399 | 1 | < 0.1% |
| 3618398 | 1 | < 0.1% |
| 3618397 | 1 | < 0.1% |
| 3618396 | 1 | < 0.1% |
| 3618395 | 1 | < 0.1% |
| 3618394 | 1 | < 0.1% |
| 3618392 | 1 | < 0.1% |
| Other values (4807986) | 4807986 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 11 | 1 | |
| 12 | 1 |
| Value | Count | Frequency (%) |
| 5290701 | 1 | |
| 5290700 | 1 | |
| 5290699 | 1 | |
| 5290698 | 1 | |
| 5290697 | 1 | |
| 5290696 | 1 | |
| 5290695 | 1 | |
| 5290694 | 1 | |
| 5290693 | 1 | |
| 5290692 | 1 |
| Distinct | 4960 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 36.7 MiB |
| 2003-03-13 | 144547 |
|---|---|
| 2002-08-18 | 9396 |
| 2003-08-04 | 8886 |
| 2002-05-22 | 8314 |
| 2002-07-20 | 8105 |
| Other values (4955) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 48079950 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 524 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2018-06-28 |
|---|---|
| 2nd row | 2018-08-27 |
| 3rd row | 2018-02-23 |
| 4th row | 2013-12-27 |
| 5th row | 2018-03-26 |
Common Values
| Value | Count | Frequency (%) |
| 2003-03-13 | 144547 | 3.0% |
| 2002-08-18 | 9396 | 0.2% |
| 2003-08-04 | 8886 | 0.2% |
| 2002-05-22 | 8314 | 0.2% |
| 2002-07-20 | 8105 | 0.2% |
| 2006-04-08 | 7738 | 0.2% |
| 2006-04-01 | 7421 | 0.2% |
| 2006-08-19 | 7285 | 0.2% |
| 2002-09-07 | 7157 | 0.1% |
| 2002-07-05 | 6831 | 0.1% |
| Other values (4950) | 4592315 |
Length
| Value | Count | Frequency (%) |
| 2003-03-13 | 144547 | 3.0% |
| 2002-08-18 | 9396 | 0.2% |
| 2003-08-04 | 8886 | 0.2% |
| 2002-05-22 | 8314 | 0.2% |
| 2002-07-20 | 8105 | 0.2% |
| 2006-04-08 | 7738 | 0.2% |
| 2006-04-01 | 7421 | 0.2% |
| 2006-08-19 | 7285 | 0.2% |
| 2002-09-07 | 7157 | 0.1% |
| 2002-07-05 | 6831 | 0.1% |
| Other values (4950) | 4592315 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 12338158 | |
| - | 9615990 | |
| 2 | 8090876 | |
| 1 | 7630928 | |
| 8 | 1850789 | 3.8% |
| 3 | 1832746 | 3.8% |
| 7 | 1682960 | 3.5% |
| 6 | 1468942 | 3.1% |
| 5 | 1280497 | 2.7% |
| 4 | 1250563 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 38463960 | |
| Dash Punctuation | 9615990 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12338158 | |
| 2 | 8090876 | |
| 1 | 7630928 | |
| 8 | 1850789 | 4.8% |
| 3 | 1832746 | 4.8% |
| 7 | 1682960 | 4.4% |
| 6 | 1468942 | 3.8% |
| 5 | 1280497 | 3.3% |
| 4 | 1250563 | 3.3% |
| 9 | 1037501 | 2.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9615990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 48079950 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 12338158 | |
| - | 9615990 | |
| 2 | 8090876 | |
| 1 | 7630928 | |
| 8 | 1850789 | 3.8% |
| 3 | 1832746 | 3.8% |
| 7 | 1682960 | 3.5% |
| 6 | 1468942 | 3.1% |
| 5 | 1280497 | 2.7% |
| 4 | 1250563 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48079950 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 12338158 | |
| - | 9615990 | |
| 2 | 8090876 | |
| 1 | 7630928 | |
| 8 | 1850789 | 3.8% |
| 3 | 1832746 | 3.8% |
| 7 | 1682960 | 3.5% |
| 6 | 1468942 | 3.1% |
| 5 | 1280497 | 2.7% |
| 4 | 1250563 | 2.6% |
| Distinct | 1364 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 36.7 MiB |
| 2018-10-01 | |
|---|---|
| 2018-09-30 | |
| 2018-10-02 | |
| 2018-09-25 | 21591 |
| 2018-09-27 | 19269 |
| Other values (1359) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 48079960 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 62 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2018-10-02 |
|---|---|
| 2nd row | 2018-11-29 |
| 3rd row | 2018-02-27 |
| 4th row | 2018-10-01 |
| 5th row | 2018-03-28 |
Common Values
| Value | Count | Frequency (%) |
| 2018-10-01 | 1654010 | |
| 2018-09-30 | 1433585 | |
| 2018-10-02 | 218865 | 4.6% |
| 2018-09-25 | 21591 | 0.4% |
| 2018-09-27 | 19269 | 0.4% |
| 2018-11-13 | 16625 | 0.3% |
| 2018-11-27 | 16263 | 0.3% |
| 2018-11-28 | 16141 | 0.3% |
| 2018-12-11 | 16012 | 0.3% |
| 2018-12-04 | 15902 | 0.3% |
| Other values (1354) | 1379733 |
Length
| Value | Count | Frequency (%) |
| 2018-10-01 | 1654010 | |
| 2018-09-30 | 1433585 | |
| 2018-10-02 | 218865 | 4.6% |
| 2018-09-25 | 21591 | 0.4% |
| 2018-09-27 | 19269 | 0.4% |
| 2018-11-13 | 16625 | 0.3% |
| 2018-11-27 | 16263 | 0.3% |
| 2018-11-28 | 16141 | 0.3% |
| 2018-12-11 | 16012 | 0.3% |
| 2018-12-04 | 15902 | 0.3% |
| Other values (1354) | 1379733 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 12944259 | |
| 1 | 10110933 | |
| - | 9615992 | |
| 2 | 5981452 | |
| 8 | 4521601 | 9.4% |
| 9 | 1755763 | 3.7% |
| 3 | 1751997 | 3.6% |
| 7 | 431774 | 0.9% |
| 6 | 385296 | 0.8% |
| 5 | 352701 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 38463968 | |
| Dash Punctuation | 9615992 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12944259 | |
| 1 | 10110933 | |
| 2 | 5981452 | |
| 8 | 4521601 | 11.8% |
| 9 | 1755763 | 4.6% |
| 3 | 1751997 | 4.6% |
| 7 | 431774 | 1.1% |
| 6 | 385296 | 1.0% |
| 5 | 352701 | 0.9% |
| 4 | 228192 | 0.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9615992 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 48079960 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 12944259 | |
| 1 | 10110933 | |
| - | 9615992 | |
| 2 | 5981452 | |
| 8 | 4521601 | 9.4% |
| 9 | 1755763 | 3.7% |
| 3 | 1751997 | 3.6% |
| 7 | 431774 | 0.9% |
| 6 | 385296 | 0.8% |
| 5 | 352701 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48079960 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 12944259 | |
| 1 | 10110933 | |
| - | 9615992 | |
| 2 | 5981452 | |
| 8 | 4521601 | 9.4% |
| 9 | 1755763 | 3.7% |
| 3 | 1751997 | 3.6% |
| 7 | 431774 | 0.9% |
| 6 | 385296 | 0.8% |
| 5 | 352701 | 0.7% |
vlr_renda_media_fam
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 2806 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 138 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 279.7731331 |
| Minimum | 0 |
|---|---|
| Maximum | 2862 |
| Zeros | 494189 |
| Zeros (%) | 10.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 36.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 33 |
| median | 100 |
| Q3 | 440 |
| 95-th percentile | 954 |
| Maximum | 2862 |
| Range | 2862 |
| Interquartile range (IQR) | 407 |
Descriptive statistics
| Standard deviation | 350.8036979 |
|---|---|
| Coefficient of variation (CV) | 1.253886297 |
| Kurtosis | 3.140338859 |
| Mean | 279.7731331 |
| Median Absolute Deviation (MAD) | 100 |
| Skewness | 1.685596758 |
| Sum | 1345109496 |
| Variance | 123063.2345 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 494189 | 10.3% |
| 954 | 212385 | 4.4% |
| 50 | 207430 | 4.3% |
| 937 | 200318 | 4.2% |
| 100 | 126628 | 2.6% |
| 75 | 112083 | 2.3% |
| 477 | 103440 | 2.2% |
| 66 | 91883 | 1.9% |
| 25 | 82715 | 1.7% |
| 33 | 78055 | 1.6% |
| Other values (2796) | 3098732 |
| Value | Count | Frequency (%) |
| 0 | 494189 | |
| 1 | 12608 | 0.3% |
| 2 | 21656 | 0.5% |
| 3 | 15449 | 0.3% |
| 4 | 23636 | 0.5% |
| 5 | 26443 | 0.5% |
| 6 | 25292 | 0.5% |
| 7 | 9765 | 0.2% |
| 8 | 38369 | 0.8% |
| 9 | 5884 | 0.1% |
| Value | Count | Frequency (%) |
| 2862 | 61 | |
| 2861 | 2 | < 0.1% |
| 2860 | 5 | < 0.1% |
| 2859 | 4 | < 0.1% |
| 2858 | 1 | < 0.1% |
| 2857 | 2 | < 0.1% |
| 2855 | 1 | < 0.1% |
| 2854 | 4 | < 0.1% |
| 2853 | 1 | < 0.1% |
| 2852 | 1 | < 0.1% |
| Distinct | 1376 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 36.7 MiB |
| 2018-09-13 | 17127 |
|---|---|
| 2018-09-11 | 16350 |
| 2018-09-12 | 16257 |
| 2018-11-13 | 15519 |
| 2018-09-04 | 15451 |
| Other values (1371) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 48079960 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 32 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2018-06-28 |
|---|---|
| 2nd row | 2018-11-29 |
| 3rd row | 2018-02-23 |
| 4th row | 2017-06-22 |
| 5th row | 2018-03-26 |
Common Values
| Value | Count | Frequency (%) |
| 2018-09-13 | 17127 | 0.4% |
| 2018-09-11 | 16350 | 0.3% |
| 2018-09-12 | 16257 | 0.3% |
| 2018-11-13 | 15519 | 0.3% |
| 2018-09-04 | 15451 | 0.3% |
| 2018-11-28 | 14942 | 0.3% |
| 2018-09-10 | 14898 | 0.3% |
| 2018-08-07 | 14884 | 0.3% |
| 2018-11-12 | 14855 | 0.3% |
| 2018-11-21 | 14852 | 0.3% |
| Other values (1366) | 4652861 |
Length
| Value | Count | Frequency (%) |
| 2018-09-13 | 17127 | 0.4% |
| 2018-09-11 | 16350 | 0.3% |
| 2018-09-12 | 16257 | 0.3% |
| 2018-11-13 | 15519 | 0.3% |
| 2018-09-04 | 15451 | 0.3% |
| 2018-11-28 | 14942 | 0.3% |
| 2018-09-10 | 14898 | 0.3% |
| 2018-08-07 | 14884 | 0.3% |
| 2018-11-12 | 14855 | 0.3% |
| 2018-11-21 | 14852 | 0.3% |
| Other values (1366) | 4652861 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 10567252 | |
| - | 9615992 | |
| 1 | 9105399 | |
| 2 | 7526830 | |
| 8 | 3524905 | 7.3% |
| 7 | 2357540 | 4.9% |
| 6 | 1445419 | 3.0% |
| 3 | 1097880 | 2.3% |
| 5 | 1058624 | 2.2% |
| 9 | 932588 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 38463968 | |
| Dash Punctuation | 9615992 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 10567252 | |
| 1 | 9105399 | |
| 2 | 7526830 | |
| 8 | 3524905 | 9.2% |
| 7 | 2357540 | 6.1% |
| 6 | 1445419 | 3.8% |
| 3 | 1097880 | 2.9% |
| 5 | 1058624 | 2.8% |
| 9 | 932588 | 2.4% |
| 4 | 847531 | 2.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9615992 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 48079960 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 10567252 | |
| - | 9615992 | |
| 1 | 9105399 | |
| 2 | 7526830 | |
| 8 | 3524905 | 7.3% |
| 7 | 2357540 | 4.9% |
| 6 | 1445419 | 3.0% |
| 3 | 1097880 | 2.3% |
| 5 | 1058624 | 2.2% |
| 9 | 932588 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48079960 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 10567252 | |
| - | 9615992 | |
| 1 | 9105399 | |
| 2 | 7526830 | |
| 8 | 3524905 | 7.3% |
| 7 | 2357540 | 4.9% |
| 6 | 1445419 | 3.0% |
| 3 | 1097880 | 2.3% |
| 5 | 1058624 | 2.2% |
| 9 | 932588 | 1.9% |
cod_local_domic_fam
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 20727 |
| Missing (%) | 0.4% |
| Memory size | 36.7 MiB |
| 1.0 | |
|---|---|
| 2.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 14361807 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 3852815 | |
| 2.0 | 934454 | 19.4% |
| (Missing) | 20727 | 0.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1.0 | 3852815 | |
| 2.0 | 934454 | 19.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 4787269 | |
| 0 | 4787269 | |
| 1 | 3852815 | |
| 2 | 934454 | 6.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9574538 | |
| Other Punctuation | 4787269 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4787269 | |
| 1 | 3852815 | |
| 2 | 934454 | 9.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4787269 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14361807 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 4787269 | |
| 0 | 4787269 | |
| 1 | 3852815 | |
| 2 | 934454 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14361807 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 4787269 | |
| 0 | 4787269 | |
| 1 | 3852815 | |
| 2 | 934454 | 6.5% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 20730 |
| Missing (%) | 0.4% |
| Memory size | 36.7 MiB |
| 1.0 | |
|---|---|
| 2.0 | 161167 |
| 3.0 | 40450 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 14361798 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 4585649 | |
| 2.0 | 161167 | 3.4% |
| 3.0 | 40450 | 0.8% |
| (Missing) | 20730 | 0.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1.0 | 4585649 | |
| 2.0 | 161167 | 3.4% |
| 3.0 | 40450 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 4787266 | |
| 0 | 4787266 | |
| 1 | 4585649 | |
| 2 | 161167 | 1.1% |
| 3 | 40450 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9574532 | |
| Other Punctuation | 4787266 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4787266 | |
| 1 | 4585649 | |
| 2 | 161167 | 1.7% |
| 3 | 40450 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4787266 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14361798 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 4787266 | |
| 0 | 4787266 | |
| 1 | 4585649 | |
| 2 | 161167 | 1.1% |
| 3 | 40450 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14361798 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 4787266 | |
| 0 | 4787266 | |
| 1 | 4585649 | |
| 2 | 161167 | 1.1% |
| 3 | 40450 | 0.3% |
qtd_comodos_domic_fam
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 224026 |
| Missing (%) | 4.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.418609851 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 184 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 36.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 5 |
| Q3 | 5 |
| 95-th percentile | 7 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.380896982 |
|---|---|
| Coefficient of variation (CV) | 0.3125184229 |
| Kurtosis | 1.64972056 |
| Mean | 4.418609851 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.2039207811 |
| Sum | 20254775 |
| Variance | 1.906876475 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 1618757 | |
| 4 | 1114384 | |
| 3 | 700405 | |
| 6 | 525483 | 10.9% |
| 2 | 310040 | 6.4% |
| 7 | 150272 | 3.1% |
| 1 | 83307 | 1.7% |
| 8 | 54874 | 1.1% |
| 9 | 15828 | 0.3% |
| 10 | 6554 | 0.1% |
| Other values (11) | 4066 | 0.1% |
| (Missing) | 224026 | 4.7% |
| Value | Count | Frequency (%) |
| 0 | 184 | < 0.1% |
| 1 | 83307 | 1.7% |
| 2 | 310040 | 6.4% |
| 3 | 700405 | |
| 4 | 1114384 | |
| 5 | 1618757 | |
| 6 | 525483 | 10.9% |
| 7 | 150272 | 3.1% |
| 8 | 54874 | 1.1% |
| 9 | 15828 | 0.3% |
| Value | Count | Frequency (%) |
| 20 | 69 | < 0.1% |
| 19 | 20 | < 0.1% |
| 18 | 34 | < 0.1% |
| 17 | 27 | < 0.1% |
| 16 | 55 | < 0.1% |
| 15 | 107 | < 0.1% |
| 14 | 187 | < 0.1% |
| 13 | 392 | < 0.1% |
| 12 | 1135 | |
| 11 | 1856 |
qtd_comodos_dormitorio_fam
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 222998 |
| Missing (%) | 4.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.775068386 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 1999 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 36.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7538018879 |
|---|---|
| Coefficient of variation (CV) | 0.4246607589 |
| Kurtosis | 17.08510656 |
| Mean | 1.775068386 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.569846723 |
| Sum | 8138685 |
| Variance | 0.5682172861 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 2175147 | |
| 1 | 1766770 | |
| 3 | 566551 | 11.8% |
| 4 | 60978 | 1.3% |
| 5 | 9646 | 0.2% |
| 6 | 2382 | < 0.1% |
| 0 | 1999 | < 0.1% |
| 7 | 553 | < 0.1% |
| 8 | 240 | < 0.1% |
| 12 | 162 | < 0.1% |
| Other values (10) | 570 | < 0.1% |
| (Missing) | 222998 | 4.6% |
| Value | Count | Frequency (%) |
| 0 | 1999 | < 0.1% |
| 1 | 1766770 | |
| 2 | 2175147 | |
| 3 | 566551 | 11.8% |
| 4 | 60978 | 1.3% |
| 5 | 9646 | 0.2% |
| 6 | 2382 | < 0.1% |
| 7 | 553 | < 0.1% |
| 8 | 240 | < 0.1% |
| 9 | 70 | < 0.1% |
| Value | Count | Frequency (%) |
| 20 | 157 | |
| 18 | 3 | < 0.1% |
| 17 | 5 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 47 | < 0.1% |
| 14 | 43 | < 0.1% |
| 13 | 22 | < 0.1% |
| 12 | 162 | |
| 11 | 61 | < 0.1% |
| 10 | 161 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 222349 |
| Missing (%) | 4.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.586297637 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 36.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 5 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.537008739 |
|---|---|
| Coefficient of variation (CV) | 0.4285781312 |
| Kurtosis | -1.715803855 |
| Mean | 3.586297637 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.1741459638 |
| Sum | 16445495 |
| Variance | 2.362395864 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 2302685 | |
| 2 | 1829595 | |
| 1 | 185782 | 3.9% |
| 4 | 167288 | 3.5% |
| 3 | 70236 | 1.5% |
| 7 | 26872 | 0.6% |
| 6 | 3189 | 0.1% |
| (Missing) | 222349 | 4.6% |
| Value | Count | Frequency (%) |
| 1 | 185782 | 3.9% |
| 2 | 1829595 | |
| 3 | 70236 | 1.5% |
| 4 | 167288 | 3.5% |
| 5 | 2302685 | |
| 6 | 3189 | 0.1% |
| 7 | 26872 | 0.6% |
| Value | Count | Frequency (%) |
| 7 | 26872 | 0.6% |
| 6 | 3189 | 0.1% |
| 5 | 2302685 | |
| 4 | 167288 | 3.5% |
| 3 | 70236 | 1.5% |
| 2 | 1829595 | |
| 1 | 185782 | 3.9% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 222349 |
| Missing (%) | 4.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.540680955 |
| Minimum | 1 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 36.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.233464673 |
|---|---|
| Coefficient of variation (CV) | 0.8005970794 |
| Kurtosis | 11.39308843 |
| Mean | 1.540680955 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.219770058 |
| Sum | 7065019 |
| Variance | 1.5214351 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3365529 | |
| 2 | 690555 | 14.4% |
| 3 | 274208 | 5.7% |
| 6 | 76891 | 1.6% |
| 5 | 62766 | 1.3% |
| 8 | 60505 | 1.3% |
| 4 | 49937 | 1.0% |
| 7 | 5256 | 0.1% |
| (Missing) | 222349 | 4.6% |
| Value | Count | Frequency (%) |
| 1 | 3365529 | |
| 2 | 690555 | 14.4% |
| 3 | 274208 | 5.7% |
| 4 | 49937 | 1.0% |
| 5 | 62766 | 1.3% |
| 6 | 76891 | 1.6% |
| 7 | 5256 | 0.1% |
| 8 | 60505 | 1.3% |
| Value | Count | Frequency (%) |
| 8 | 60505 | 1.3% |
| 7 | 5256 | 0.1% |
| 6 | 76891 | 1.6% |
| 5 | 62766 | 1.3% |
| 4 | 49937 | 1.0% |
| 3 | 274208 | 5.7% |
| 2 | 690555 | 14.4% |
| 1 | 3365529 |
cod_agua_canalizada_fam
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 222349 |
| Missing (%) | 4.6% |
| Memory size | 36.7 MiB |
| 1.0 | |
|---|---|
| 2.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 13756941 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 4023381 | |
| 2.0 | 562266 | 11.7% |
| (Missing) | 222349 | 4.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1.0 | 4023381 | |
| 2.0 | 562266 | 12.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 4585647 | |
| 0 | 4585647 | |
| 1 | 4023381 | |
| 2 | 562266 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9171294 | |
| Other Punctuation | 4585647 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4585647 | |
| 1 | 4023381 | |
| 2 | 562266 | 6.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4585647 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13756941 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 4585647 | |
| 0 | 4585647 | |
| 1 | 4023381 | |
| 2 | 562266 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13756941 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 4585647 | |
| 0 | 4585647 | |
| 1 | 4023381 | |
| 2 | 562266 | 4.1% |
cod_abaste_agua_domic_fam
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 222349 |
| Missing (%) | 4.6% |
| Memory size | 36.7 MiB |
| 1.0 | |
|---|---|
| 2.0 | |
| 4.0 | 211057 |
| 3.0 | 143273 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 13756941 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 4.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 3537937 | |
| 2.0 | 693380 | 14.4% |
| 4.0 | 211057 | 4.4% |
| 3.0 | 143273 | 3.0% |
| (Missing) | 222349 | 4.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1.0 | 3537937 | |
| 2.0 | 693380 | 15.1% |
| 4.0 | 211057 | 4.6% |
| 3.0 | 143273 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 4585647 | |
| 0 | 4585647 | |
| 1 | 3537937 | |
| 2 | 693380 | 5.0% |
| 4 | 211057 | 1.5% |
| 3 | 143273 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9171294 | |
| Other Punctuation | 4585647 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4585647 | |
| 1 | 3537937 | |
| 2 | 693380 | 7.6% |
| 4 | 211057 | 2.3% |
| 3 | 143273 | 1.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4585647 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13756941 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 4585647 | |
| 0 | 4585647 | |
| 1 | 3537937 | |
| 2 | 693380 | 5.0% |
| 4 | 211057 | 1.5% |
| 3 | 143273 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13756941 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 4585647 | |
| 0 | 4585647 | |
| 1 | 3537937 | |
| 2 | 693380 | 5.0% |
| 4 | 211057 | 1.5% |
| 3 | 143273 | 1.0% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 222349 |
| Missing (%) | 4.6% |
| Memory size | 36.7 MiB |
| 1.0 | |
|---|---|
| 2.0 | 272246 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 13756941 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 4313401 | |
| 2.0 | 272246 | 5.7% |
| (Missing) | 222349 | 4.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1.0 | 4313401 | |
| 2.0 | 272246 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 4585647 | |
| 0 | 4585647 | |
| 1 | 4313401 | |
| 2 | 272246 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9171294 | |
| Other Punctuation | 4585647 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4585647 | |
| 1 | 4313401 | |
| 2 | 272246 | 3.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4585647 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13756941 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 4585647 | |
| 0 | 4585647 | |
| 1 | 4313401 | |
| 2 | 272246 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13756941 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 4585647 | |
| 0 | 4585647 | |
| 1 | 4313401 | |
| 2 | 272246 | 2.0% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 494595 |
| Missing (%) | 10.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.864986585 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 36.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 3 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.04023785 |
|---|---|
| Coefficient of variation (CV) | 0.5577722962 |
| Kurtosis | 0.6721160173 |
| Mean | 1.864986585 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.9832491488 |
| Sum | 8044435 |
| Variance | 1.082094784 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2257758 | |
| 3 | 1244054 | |
| 2 | 648989 | 13.5% |
| 4 | 88082 | 1.8% |
| 5 | 42899 | 0.9% |
| 6 | 31619 | 0.7% |
| (Missing) | 494595 | 10.3% |
| Value | Count | Frequency (%) |
| 1 | 2257758 | |
| 2 | 648989 | 13.5% |
| 3 | 1244054 | |
| 4 | 88082 | 1.8% |
| 5 | 42899 | 0.9% |
| 6 | 31619 | 0.7% |
| Value | Count | Frequency (%) |
| 6 | 31619 | 0.7% |
| 5 | 42899 | 0.9% |
| 4 | 88082 | 1.8% |
| 3 | 1244054 | |
| 2 | 648989 | 13.5% |
| 1 | 2257758 |
cod_destino_lixo_domic_fam
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 222349 |
| Missing (%) | 4.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.409208123 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 36.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.8344851491 |
|---|---|
| Coefficient of variation (CV) | 0.5921660085 |
| Kurtosis | 3.966479432 |
| Mean | 1.409208123 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.023718151 |
| Sum | 6462131 |
| Variance | 0.6963654641 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3575970 | |
| 3 | 665277 | 13.8% |
| 2 | 262543 | 5.5% |
| 4 | 62082 | 1.3% |
| 6 | 18041 | 0.4% |
| 5 | 1734 | < 0.1% |
| (Missing) | 222349 | 4.6% |
| Value | Count | Frequency (%) |
| 1 | 3575970 | |
| 2 | 262543 | 5.5% |
| 3 | 665277 | 13.8% |
| 4 | 62082 | 1.3% |
| 5 | 1734 | < 0.1% |
| 6 | 18041 | 0.4% |
| Value | Count | Frequency (%) |
| 6 | 18041 | 0.4% |
| 5 | 1734 | < 0.1% |
| 4 | 62082 | 1.3% |
| 3 | 665277 | 13.8% |
| 2 | 262543 | 5.5% |
| 1 | 3575970 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 222349 |
| Missing (%) | 4.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.317827343 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 36.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.9058275392 |
|---|---|
| Coefficient of variation (CV) | 0.6873643534 |
| Kurtosis | 13.16471092 |
| Mean | 1.317827343 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.515250495 |
| Sum | 6043091 |
| Variance | 0.8205235307 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3888367 | |
| 2 | 279385 | 5.8% |
| 3 | 271003 | 5.6% |
| 6 | 86391 | 1.8% |
| 4 | 37906 | 0.8% |
| 5 | 22595 | 0.5% |
| (Missing) | 222349 | 4.6% |
| Value | Count | Frequency (%) |
| 1 | 3888367 | |
| 2 | 279385 | 5.8% |
| 3 | 271003 | 5.6% |
| 4 | 37906 | 0.8% |
| 5 | 22595 | 0.5% |
| 6 | 86391 | 1.8% |
| Value | Count | Frequency (%) |
| 6 | 86391 | 1.8% |
| 5 | 22595 | 0.5% |
| 4 | 37906 | 0.8% |
| 3 | 271003 | 5.6% |
| 2 | 279385 | 5.8% |
| 1 | 3888367 |
cod_calcamento_domic_fam
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 222350 |
| Missing (%) | 4.6% |
| Memory size | 36.7 MiB |
| 1.0 | |
|---|---|
| 3.0 | |
| 2.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 13756938 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 3.0 |
| 4th row | 3.0 |
| 5th row | 3.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 2720352 | |
| 3.0 | 1579965 | |
| 2.0 | 285329 | 5.9% |
| (Missing) | 222350 | 4.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1.0 | 2720352 | |
| 3.0 | 1579965 | |
| 2.0 | 285329 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 4585646 | |
| 0 | 4585646 | |
| 1 | 2720352 | |
| 3 | 1579965 | 11.5% |
| 2 | 285329 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9171292 | |
| Other Punctuation | 4585646 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4585646 | |
| 1 | 2720352 | |
| 3 | 1579965 | 17.2% |
| 2 | 285329 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4585646 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13756938 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 4585646 | |
| 0 | 4585646 | |
| 1 | 2720352 | |
| 3 | 1579965 | 11.5% |
| 2 | 285329 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13756938 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 4585646 | |
| 0 | 4585646 | |
| 1 | 2720352 | |
| 3 | 1579965 | 11.5% |
| 2 | 285329 | 2.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 36.7 MiB |
| 2.0 | |
|---|---|
| 1.0 | 25168 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 14423982 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 2.0 |
| 4th row | 2.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 4782826 | |
| 1.0 | 25168 | 0.5% |
| (Missing) | 2 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2.0 | 4782826 | |
| 1.0 | 25168 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 4807994 | |
| 0 | 4807994 | |
| 2 | 4782826 | |
| 1 | 25168 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9615988 | |
| Other Punctuation | 4807994 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4807994 | |
| 2 | 4782826 | |
| 1 | 25168 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4807994 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14423982 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 4807994 | |
| 0 | 4807994 | |
| 2 | 4782826 | |
| 1 | 25168 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14423982 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 4807994 | |
| 0 | 4807994 | |
| 2 | 4782826 | |
| 1 | 25168 | 0.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 25170 |
| Missing (%) | 0.5% |
| Memory size | 36.7 MiB |
| 2.0 | |
|---|---|
| 1.0 | 29310 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 14348478 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 2.0 |
| 4th row | 2.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 4753516 | |
| 1.0 | 29310 | 0.6% |
| (Missing) | 25170 | 0.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2.0 | 4753516 | |
| 1.0 | 29310 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 4782826 | |
| 0 | 4782826 | |
| 2 | 4753516 | |
| 1 | 29310 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9565652 | |
| Other Punctuation | 4782826 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4782826 | |
| 2 | 4753516 | |
| 1 | 29310 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4782826 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14348478 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 4782826 | |
| 0 | 4782826 | |
| 2 | 4753516 | |
| 1 | 29310 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14348478 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 4782826 | |
| 0 | 4782826 | |
| 2 | 4753516 | |
| 1 | 29310 | 0.2% |
| Distinct | 25584 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 2441370 |
| Missing (%) | 50.8% |
| Memory size | 36.7 MiB |
| CLINICA DA FAMILIA | 19037 |
|---|---|
| POSTO DE COLETA DE CODO I | 4212 |
| HOSPITAL MUNICIPAL JAMEL CECILIO ANAPOLIS | 4014 |
| UNIDADE DE SAUDE FAMILIAR COMUNITARIA | 2451 |
| HOSPITAL MUNICIPAL DE IPIRA | 2044 |
| Other values (25579) |
Length
| Max length | 60 |
|---|---|
| Median length | 44 |
| Mean length | 29.54982874 |
| Min length | 3 |
Characters and Unicode
| Total characters | 69933393 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1795 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | US CAMPO VERDE |
|---|---|
| 2nd row | UNIDADE REGIONAL DE SAUDE SERRA |
| 3rd row | UNIDADE BASICA DE SAUDE VILA NOVA DE COLARES |
| 4th row | UNIDADE DE SAUDE DA FAMILIA DE ULISSES GUIMARAES |
| 5th row | UNIDADE DE SAUDE DA FAMILIA DE TERRA VERMELHA |
Common Values
| Value | Count | Frequency (%) |
| CLINICA DA FAMILIA | 19037 | 0.4% |
| POSTO DE COLETA DE CODO I | 4212 | 0.1% |
| HOSPITAL MUNICIPAL JAMEL CECILIO ANAPOLIS | 4014 | 0.1% |
| UNIDADE DE SAUDE FAMILIAR COMUNITARIA | 2451 | 0.1% |
| HOSPITAL MUNICIPAL DE IPIRA | 2044 | < 0.1% |
| UBS DE SANTALUZ | 2021 | < 0.1% |
| C S F ARGEU HERBSTER | 1783 | < 0.1% |
| UNIDADE MISTA DE AFUA | 1775 | < 0.1% |
| SECRETARIA MUNICIPAL DA SAUDE DE IJUI VIGILANCIA EM SAUDE | 1773 | < 0.1% |
| CENTRO DE SAUDE SAO FRANCISCO | 1753 | < 0.1% |
| Other values (25574) | 2325763 | |
| (Missing) | 2441370 |
Length
| Value | Count | Frequency (%) |
| de | 1418339 | 11.3% |
| saude | 978408 | 7.8% |
| unidade | 584627 | 4.7% |
| da | 483610 | 3.9% |
| ubs | 438532 | 3.5% |
| familia | 349953 | 2.8% |
| centro | 305587 | 2.4% |
| psf | 209944 | 1.7% |
| usf | 202520 | 1.6% |
| basica | 185989 | 1.5% |
| Other values (11980) | 7349855 |
Most occurring characters
| Value | Count | Frequency (%) |
| 10140738 | ||
| A | 9127347 | |
| E | 6243142 | |
| D | 5530745 | 7.9% |
| I | 5234689 | 7.5% |
| S | 4842292 | 6.9% |
| O | 4121607 | 5.9% |
| U | 3498777 | 5.0% |
| R | 3367949 | 4.8% |
| N | 2997304 | 4.3% |
| Other values (27) | 14828803 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 59470859 | |
| Space Separator | 10140738 | 14.5% |
| Decimal Number | 321796 | 0.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 9127347 | |
| E | 6243142 | |
| D | 5530745 | |
| I | 5234689 | |
| S | 4842292 | 8.1% |
| O | 4121607 | 6.9% |
| U | 3498777 | 5.9% |
| R | 3367949 | 5.7% |
| N | 2997304 | 5.0% |
| L | 2139306 | 3.6% |
| Other values (16) | 12367701 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 60180 | |
| 2 | 57606 | |
| 3 | 51712 | |
| 0 | 50853 | |
| 5 | 30567 | |
| 4 | 23747 | 7.4% |
| 7 | 12913 | 4.0% |
| 6 | 12822 | 4.0% |
| 9 | 11044 | 3.4% |
| 8 | 10352 | 3.2% |
Space Separator
| Value | Count | Frequency (%) |
| 10140738 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 59470859 | |
| Common | 10462534 | 15.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 9127347 | |
| E | 6243142 | |
| D | 5530745 | |
| I | 5234689 | |
| S | 4842292 | 8.1% |
| O | 4121607 | 6.9% |
| U | 3498777 | 5.9% |
| R | 3367949 | 5.7% |
| N | 2997304 | 5.0% |
| L | 2139306 | 3.6% |
| Other values (16) | 12367701 |
Common
| Value | Count | Frequency (%) |
| 10140738 | ||
| 1 | 60180 | 0.6% |
| 2 | 57606 | 0.6% |
| 3 | 51712 | 0.5% |
| 0 | 50853 | 0.5% |
| 5 | 30567 | 0.3% |
| 4 | 23747 | 0.2% |
| 7 | 12913 | 0.1% |
| 6 | 12822 | 0.1% |
| 9 | 11044 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 69933393 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 10140738 | ||
| A | 9127347 | |
| E | 6243142 | |
| D | 5530745 | 7.9% |
| I | 5234689 | 7.5% |
| S | 4842292 | 6.9% |
| O | 4121607 | 5.9% |
| U | 3498777 | 5.0% |
| R | 3367949 | 4.8% |
| N | 2997304 | 4.3% |
| Other values (27) | 14828803 |
| Distinct | 27005 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 2441370 |
| Missing (%) | 50.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3075005.845 |
| Minimum | 19 |
|---|---|
| Maximum | 9630546 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 36.7 MiB |
Quantile statistics
| Minimum | 19 |
|---|---|
| 5-th percentile | 24473 |
| Q1 | 2290278 |
| median | 2533979 |
| Q3 | 3182738 |
| 95-th percentile | 6874053 |
| Maximum | 9630546 |
| Range | 9630527 |
| Interquartile range (IQR) | 892460 |
Descriptive statistics
| Standard deviation | 1707394.582 |
|---|---|
| Coefficient of variation (CV) | 0.5552492151 |
| Kurtosis | 1.603776376 |
| Mean | 3075005.845 |
| Median Absolute Deviation (MAD) | 297072 |
| Skewness | 1.245833164 |
| Sum | 7.277388783 × 1012 |
| Variance | 2.915196258 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6310354 | 19031 | 0.4% |
| 3023435 | 4212 | 0.1% |
| 2361744 | 4014 | 0.1% |
| 2653966 | 2451 | 0.1% |
| 4026640 | 2044 | < 0.1% |
| 2511088 | 2021 | < 0.1% |
| 2482347 | 1783 | < 0.1% |
| 2316048 | 1775 | < 0.1% |
| 6859534 | 1773 | < 0.1% |
| 2482088 | 1710 | < 0.1% |
| Other values (26995) | 2325812 | |
| (Missing) | 2441370 |
| Value | Count | Frequency (%) |
| 19 | 33 | < 0.1% |
| 35 | 10 | < 0.1% |
| 43 | 386 | |
| 51 | 400 | |
| 86 | 18 | < 0.1% |
| 108 | 127 | < 0.1% |
| 116 | 120 | < 0.1% |
| 124 | 214 | |
| 132 | 181 | |
| 140 | 156 | < 0.1% |
| Value | Count | Frequency (%) |
| 9630546 | 3 | |
| 9618694 | 2 | |
| 9614990 | 1 | < 0.1% |
| 9614745 | 1 | < 0.1% |
| 9598405 | 1 | < 0.1% |
| 9597603 | 1 | < 0.1% |
| 9590560 | 1 | < 0.1% |
| 9575065 | 2 | |
| 9573550 | 3 | |
| 9572511 | 1 | < 0.1% |
| Distinct | 3699 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3031030 |
| Missing (%) | 63.0% |
| Memory size | 36.7 MiB |
| CRAS CENTRO DE REFERENCIA DE ASSISTENCIA SOCIAL | 39099 |
|---|---|
| CRAS CENTRO | 34910 |
| CRAS CENTRO DE REFERENCIA DA ASSISTENCIA SOCIAL | 26266 |
| CRAS | 16197 |
| CENTRO DE REFERENCIA DE ASSISTENCIA SOCIAL | 14565 |
| Other values (3694) |
Length
| Max length | 70 |
|---|---|
| Median length | 62 |
| Mean length | 22.31553277 |
| Min length | 4 |
Characters and Unicode
| Total characters | 39653943 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 186 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | CRAS DE SERRA SEDE |
|---|---|
| 2nd row | CRAS VIANA |
| 3rd row | CRAS IV ALTO MUCURI |
| 4th row | CRAS III CAMPO VERDE |
| 5th row | CRAS DE VILA NOVA DE COLARES |
Common Values
| Value | Count | Frequency (%) |
| CRAS CENTRO DE REFERENCIA DE ASSISTENCIA SOCIAL | 39099 | 0.8% |
| CRAS CENTRO | 34910 | 0.7% |
| CRAS CENTRO DE REFERENCIA DA ASSISTENCIA SOCIAL | 26266 | 0.5% |
| CRAS | 16197 | 0.3% |
| CENTRO DE REFERENCIA DE ASSISTENCIA SOCIAL | 14565 | 0.3% |
| CRAS I | 14545 | 0.3% |
| CRAS CENTRAL | 14418 | 0.3% |
| CENTRO DE REFERENCIA DA ASSISTENCIA SOCIAL | 13481 | 0.3% |
| CRAS CASA DA FAMILIA | 10952 | 0.2% |
| CRAS GRAJAU | 10554 | 0.2% |
| Other values (3689) | 1581979 | |
| (Missing) | 3031030 |
Length
| Value | Count | Frequency (%) |
| cras | 1688249 | |
| de | 459637 | 6.9% |
| centro | 269556 | 4.0% |
| social | 215685 | 3.2% |
| referencia | 214237 | 3.2% |
| assistencia | 210105 | 3.1% |
| da | 170611 | 2.6% |
| sao | 76547 | 1.1% |
| casa | 71732 | 1.1% |
| i | 69431 | 1.0% |
| Other values (3147) | 3224333 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 6010432 | |
| 4893157 | ||
| R | 4004524 | |
| S | 3703775 | |
| C | 3349080 | |
| E | 3208257 | |
| I | 2885801 | |
| O | 2200027 | 5.5% |
| N | 1718131 | 4.3% |
| D | 1413487 | 3.6% |
| Other values (28) | 6267272 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 34691229 | |
| Space Separator | 4893157 | 12.3% |
| Decimal Number | 69048 | 0.2% |
| Connector Punctuation | 509 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 6010432 | |
| R | 4004524 | |
| S | 3703775 | |
| C | 3349080 | |
| E | 3208257 | |
| I | 2885801 | |
| O | 2200027 | 6.3% |
| N | 1718131 | 5.0% |
| D | 1413487 | 4.1% |
| T | 1229859 | 3.5% |
| Other values (16) | 4967856 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 20193 | |
| 1 | 17123 | |
| 2 | 9704 | |
| 3 | 7508 | 10.9% |
| 4 | 7444 | 10.8% |
| 7 | 1996 | 2.9% |
| 8 | 1887 | 2.7% |
| 6 | 1376 | 2.0% |
| 5 | 1091 | 1.6% |
| 9 | 726 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 4893157 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 509 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34691229 | |
| Common | 4962714 | 12.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 6010432 | |
| R | 4004524 | |
| S | 3703775 | |
| C | 3349080 | |
| E | 3208257 | |
| I | 2885801 | |
| O | 2200027 | 6.3% |
| N | 1718131 | 5.0% |
| D | 1413487 | 4.1% |
| T | 1229859 | 3.5% |
| Other values (16) | 4967856 |
Common
| Value | Count | Frequency (%) |
| 4893157 | ||
| 0 | 20193 | 0.4% |
| 1 | 17123 | 0.3% |
| 2 | 9704 | 0.2% |
| 3 | 7508 | 0.2% |
| 4 | 7444 | 0.1% |
| 7 | 1996 | < 0.1% |
| 8 | 1887 | < 0.1% |
| 6 | 1376 | < 0.1% |
| 5 | 1091 | < 0.1% |
| Other values (2) | 1235 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39653943 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 6010432 | |
| 4893157 | ||
| R | 4004524 | |
| S | 3703775 | |
| C | 3349080 | |
| E | 3208257 | |
| I | 2885801 | |
| O | 2200027 | 5.5% |
| N | 1718131 | 4.3% |
| D | 1413487 | 3.6% |
| Other values (28) | 6267272 |
cod_centro_assist_fam
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 5639 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 3031030 |
| Missing (%) | 63.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.16930602 × 1010 |
| Minimum | 1.10001204 × 1010 |
|---|---|
| Maximum | 5.300109833 × 1010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 36.7 MiB |
Quantile statistics
| Minimum | 1.10001204 × 1010 |
|---|---|
| 5-th percentile | 1.40010365 × 1010 |
| Q1 | 2.510800679 × 1010 |
| median | 3.304550065 × 1010 |
| Q3 | 3.550303288 × 1010 |
| 95-th percentile | 5.006600139 × 1010 |
| Maximum | 5.300109833 × 1010 |
| Range | 4.200097794 × 1010 |
| Interquartile range (IQR) | 1.039502609 × 1010 |
Descriptive statistics
| Standard deviation | 9529194109 |
|---|---|
| Coefficient of variation (CV) | 0.3006713158 |
| Kurtosis | -0.155210341 |
| Mean | 3.16930602 × 1010 |
| Median Absolute Deviation (MAD) | 5981499146 |
| Skewness | -0.04535792694 |
| Sum | 5.63174904 × 1016 |
| Variance | 9.080554037 × 1019 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.550303288 × 1010 | 10554 | 0.2% |
| 3.550300162 × 1010 | 10447 | 0.2% |
| 3.550300165 × 1010 | 8915 | 0.2% |
| 3.550303289 × 1010 | 7656 | 0.2% |
| 2.30370012 × 1010 | 7220 | 0.2% |
| 3.550300177 × 1010 | 6999 | 0.1% |
| 3.550300167 × 1010 | 6998 | 0.1% |
| 3.55030018 × 1010 | 6291 | 0.1% |
| 3.550300168 × 1010 | 6144 | 0.1% |
| 3.550300163 × 1010 | 6000 | 0.1% |
| Other values (5629) | 1699742 | |
| (Missing) | 3031030 |
| Value | Count | Frequency (%) |
| 1.10001204 × 1010 | 357 | |
| 1.100020668 × 1010 | 350 | < 0.1% |
| 1.100051504 × 1010 | 56 | < 0.1% |
| 1.100061099 × 1010 | 26 | < 0.1% |
| 1.100070427 × 1010 | 98 | < 0.1% |
| 1.100081528 × 1010 | 267 | < 0.1% |
| 1.100092022 × 1010 | 12 | < 0.1% |
| 1.100111019 × 1010 | 887 | |
| 1.100112041 × 1010 | 617 | |
| 1.10012039 × 1010 | 66 | < 0.1% |
| Value | Count | Frequency (%) |
| 5.300109833 × 1010 | 27 | < 0.1% |
| 5.300109755 × 1010 | 14 | < 0.1% |
| 5.300109754 × 1010 | 4 | < 0.1% |
| 5.30010967 × 1010 | 2 | < 0.1% |
| 5.30010373 × 1010 | 406 | < 0.1% |
| 5.300103611 × 1010 | 193 | < 0.1% |
| 5.300103566 × 1010 | 1250 | |
| 5.300103514 × 1010 | 332 | < 0.1% |
| 5.300103512 × 1010 | 383 | < 0.1% |
| 5.300102047 × 1010 | 359 | < 0.1% |
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 155334 |
| Missing (%) | 3.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.30041168 |
| Minimum | 0 |
|---|---|
| Maximum | 306 |
| Zeros | 4244511 |
| Zeros (%) | 88.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 36.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 205 |
| Maximum | 306 |
| Range | 306 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 63.21474913 |
|---|---|
| Coefficient of variation (CV) | 3.275305739 |
| Kurtosis | 8.191045388 |
| Mean | 19.30041168 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.102432982 |
| Sum | 89798292 |
| Variance | 3996.104508 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4244511 | |
| 205 | 263094 | 5.5% |
| 202 | 43216 | 0.9% |
| 301 | 25218 | 0.5% |
| 204 | 24160 | 0.5% |
| 306 | 22809 | 0.5% |
| 303 | 10342 | 0.2% |
| 201 | 8732 | 0.2% |
| 305 | 4535 | 0.1% |
| 304 | 2462 | 0.1% |
| Other values (3) | 3583 | 0.1% |
| (Missing) | 155334 | 3.2% |
| Value | Count | Frequency (%) |
| 0 | 4244511 | |
| 101 | 1810 | < 0.1% |
| 201 | 8732 | 0.2% |
| 202 | 43216 | 0.9% |
| 203 | 1041 | < 0.1% |
| 204 | 24160 | 0.5% |
| 205 | 263094 | 5.5% |
| 301 | 25218 | 0.5% |
| 302 | 732 | < 0.1% |
| 303 | 10342 | 0.2% |
| Value | Count | Frequency (%) |
| 306 | 22809 | 0.5% |
| 305 | 4535 | 0.1% |
| 304 | 2462 | 0.1% |
| 303 | 10342 | 0.2% |
| 302 | 732 | < 0.1% |
| 301 | 25218 | 0.5% |
| 205 | 263094 | |
| 204 | 24160 | 0.5% |
| 203 | 1041 | < 0.1% |
| 202 | 43216 | 0.9% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 36.7 MiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4807996 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 2424434 | |
| 0 | 2383562 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1 | 2424434 | |
| 0 | 2383562 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2424434 | |
| 0 | 2383562 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4807996 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2424434 | |
| 0 | 2383562 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4807996 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2424434 | |
| 0 | 2383562 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4807996 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2424434 | |
| 0 | 2383562 |
qtde_pessoas
Real number (ℝ≥0)
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.687830855 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 36.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.442161182 |
|---|---|
| Coefficient of variation (CV) | 0.53655206 |
| Kurtosis | 1.471999579 |
| Mean | 2.687830855 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.9751546479 |
| Sum | 12923080 |
| Variance | 2.079828875 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 1300664 | |
| 3 | 1155777 | |
| 1 | 1122134 | |
| 4 | 726300 | |
| 5 | 309943 | 6.4% |
| 6 | 120289 | 2.5% |
| 7 | 44831 | 0.9% |
| 8 | 17188 | 0.4% |
| 9 | 6752 | 0.1% |
| 10 | 2600 | 0.1% |
| Other values (9) | 1518 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1122134 | |
| 2 | 1300664 | |
| 3 | 1155777 | |
| 4 | 726300 | |
| 5 | 309943 | 6.4% |
| 6 | 120289 | 2.5% |
| 7 | 44831 | 0.9% |
| 8 | 17188 | 0.4% |
| 9 | 6752 | 0.1% |
| 10 | 2600 | 0.1% |
| Value | Count | Frequency (%) |
| 31 | 1 | < 0.1% |
| 18 | 3 | < 0.1% |
| 17 | 3 | < 0.1% |
| 16 | 5 | < 0.1% |
| 15 | 18 | < 0.1% |
| 14 | 38 | < 0.1% |
| 13 | 122 | < 0.1% |
| 12 | 337 | < 0.1% |
| 11 | 991 | < 0.1% |
| 10 | 2600 |
peso.fam
Real number (ℝ≥0)
| Distinct | 886 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.21170859 × 1014 |
| Minimum | 5.501656235 × 1012 |
|---|---|
| Maximum | 5.504777045 × 1014 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 36.7 MiB |
Quantile statistics
| Minimum | 5.501656235 × 1012 |
|---|---|
| 5-th percentile | 5.502917229 × 1013 |
| Q1 | 5.502215892 × 1014 |
| median | 5.502451463 × 1014 |
| Q3 | 5.502540234 × 1014 |
| 95-th percentile | 5.503448472 × 1014 |
| Maximum | 5.504777045 × 1014 |
| Range | 5.449760482 × 1014 |
| Interquartile range (IQR) | 3.243417885 × 1010 |
Descriptive statistics
| Standard deviation | 1.170883957 × 1014 |
|---|---|
| Coefficient of variation (CV) | 0.2246641263 |
| Kurtosis | 12.33229392 |
| Mean | 5.21170859 × 1014 |
| Median Absolute Deviation (MAD) | 1.243471773 × 1010 |
| Skewness | -3.783180996 |
| Sum | -2.969788772 × 1018 |
| Variance | 1.370969241 × 1028 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.502451463 × 1014 | 839807 | 17.5% |
| 5.502443092 × 1014 | 233230 | 4.9% |
| 5.50245607 × 1014 | 121686 | 2.5% |
| 5.50243164 × 1014 | 98360 | 2.0% |
| 5.502477921 × 1014 | 73396 | 1.5% |
| 5.502458232 × 1014 | 66634 | 1.4% |
| 5.502484072 × 1013 | 60027 | 1.2% |
| 5.502427962 × 1014 | 50397 | 1.0% |
| 5.502457847 × 1014 | 49254 | 1.0% |
| 5.502430102 × 1014 | 45467 | 0.9% |
| Other values (876) | 3169738 |
| Value | Count | Frequency (%) |
| 5.501656235 × 1012 | 2427 | 0.1% |
| 5.501682233 × 1012 | 1464 | < 0.1% |
| 5.5018315 × 1012 | 1769 | < 0.1% |
| 5.502526708 × 1012 | 15369 | |
| 5.50304564 × 1012 | 508 | < 0.1% |
| 5.503155193 × 1012 | 1941 | < 0.1% |
| 5.503181124 × 1012 | 3271 | 0.1% |
| 5.50442385 × 1012 | 1180 | < 0.1% |
| 5.500455455 × 1013 | 1312 | < 0.1% |
| 5.500467336 × 1013 | 1028 | < 0.1% |
| Value | Count | Frequency (%) |
| 5.504777045 × 1014 | 6 | < 0.1% |
| 5.504449465 × 1014 | 1337 | |
| 5.504443495 × 1014 | 1082 | |
| 5.504410327 × 1014 | 983 | < 0.1% |
| 5.504395098 × 1014 | 1186 | |
| 5.504391327 × 1014 | 1053 | |
| 5.504389448 × 1014 | 1292 | |
| 5.50438383 × 1014 | 1104 | |
| 5.504381965 × 1014 | 2465 | |
| 5.504380104 × 1014 | 1177 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| cd_ibge | estrato | classf | id_familia | dat_cadastramento_fam | dat_alteracao_fam | vlr_renda_media_fam | dat_atualizacao_familia | cod_local_domic_fam | cod_especie_domic_fam | qtd_comodos_domic_fam | qtd_comodos_dormitorio_fam | cod_material_piso_fam | cod_material_domic_fam | cod_agua_canalizada_fam | cod_abaste_agua_domic_fam | cod_banheiro_domic_fam | cod_escoa_sanitario_domic_fam | cod_destino_lixo_domic_fam | cod_iluminacao_domic_fam | cod_calcamento_domic_fam | cod_familia_indigena_fam | ind_familia_quilombola_fam | nom_estab_assist_saude_fam | cod_eas_fam | nom_centro_assist_fam | cod_centro_assist_fam | ind_parc_mds_fam | marc_pbf | qtde_pessoas | peso.fam | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 3205002 | 2 | 2 | 1.0 | 2018-06-28 | 2018-10-02 | 244.0 | 2018-06-28 | 1.0 | 1.0 | 5.0 | 2.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 1.0 | 1.0 | 2.0 | 2.0 | NaN | NaN | CRAS DE SERRA SEDE | 3.205003e+10 | 0.0 | 0 | 5 | 550256458545518 |
| 1 | 3205101 | 2 | 2 | 3.0 | 2018-08-27 | 2018-11-29 | 60.0 | 2018-11-29 | 1.0 | 1.0 | 5.0 | 2.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 2.0 | NaN | NaN | CRAS VIANA | 3.205103e+10 | 0.0 | 1 | 5 | 550355704647837 |
| 2 | 3201308 | 2 | 2 | 4.0 | 2018-02-23 | 2018-02-27 | 937.0 | 2018-02-23 | 1.0 | 1.0 | 4.0 | 1.0 | 2.0 | 2.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 3.0 | 2.0 | 2.0 | NaN | NaN | CRAS IV ALTO MUCURI | 3.201300e+10 | 0.0 | 0 | 1 | 550259704488172 |
| 3 | 3201308 | 2 | 2 | 6.0 | 2013-12-27 | 2018-10-01 | 44.0 | 2017-06-22 | 1.0 | 1.0 | 4.0 | 1.0 | 2.0 | 2.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 3.0 | 2.0 | 2.0 | US CAMPO VERDE | 2652994.0 | CRAS III CAMPO VERDE | 3.201300e+10 | 0.0 | 1 | 2 | 550259704488172 |
| 4 | 3205002 | 2 | 2 | 7.0 | 2018-03-26 | 2018-03-28 | 0.0 | 2018-03-26 | 1.0 | 1.0 | 4.0 | 1.0 | 5.0 | 1.0 | 2.0 | 4.0 | 1.0 | 5.0 | 3.0 | 1.0 | 3.0 | 2.0 | 2.0 | UNIDADE REGIONAL DE SAUDE SERRA | 2465795.0 | NaN | NaN | 0.0 | 1 | 2 | 550256458545518 |
| 5 | 3205002 | 2 | 2 | 8.0 | 2016-10-27 | 2018-10-01 | 176.0 | 2016-10-27 | 1.0 | 1.0 | 6.0 | 3.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 2.0 | 2.0 | UNIDADE BASICA DE SAUDE VILA NOVA DE COLARES | 2522845.0 | CRAS DE VILA NOVA DE COLARES | 3.205000e+10 | 0.0 | 1 | 5 | 550256458545518 |
| 6 | 3205200 | 2 | 2 | 9.0 | 2015-06-16 | 2018-10-01 | 312.0 | 2018-03-20 | 1.0 | 1.0 | 5.0 | 2.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 3.0 | 1.0 | 1.0 | 3.0 | 2.0 | 2.0 | UNIDADE DE SAUDE DA FAMILIA DE ULISSES GUIMARAES | 3346501.0 | CRAS JABAETE | 3.205202e+10 | 0.0 | 0 | 3 | 550245146328323 |
| 7 | 3201308 | 2 | 2 | 10.0 | 2017-04-05 | 2018-10-01 | 954.0 | 2018-07-04 | 1.0 | 1.0 | 1.0 | 1.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 1.0 | 2.0 | 2.0 | NaN | NaN | CRAS VII SOTELANDIA | 3.201304e+10 | 0.0 | 0 | 1 | 550259704488172 |
| 8 | 3205200 | 2 | 2 | 11.0 | 2018-10-03 | 2018-10-15 | 477.0 | 2018-10-03 | 1.0 | 1.0 | 5.0 | 2.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 2.0 | UNIDADE DE SAUDE DA FAMILIA DE TERRA VERMELHA | 2403412.0 | CRAS MORADA DA BARRA | 3.205200e+10 | 0.0 | 0 | 2 | 550245146328323 |
| 9 | 3205002 | 2 | 2 | 12.0 | 2016-05-11 | 2016-05-11 | 4.0 | 2016-05-11 | 1.0 | 1.0 | 1.0 | 1.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 2.0 | NaN | NaN | NaN | NaN | 0.0 | 1 | 3 | 550256458545518 |
Last rows
| cd_ibge | estrato | classf | id_familia | dat_cadastramento_fam | dat_alteracao_fam | vlr_renda_media_fam | dat_atualizacao_familia | cod_local_domic_fam | cod_especie_domic_fam | qtd_comodos_domic_fam | qtd_comodos_dormitorio_fam | cod_material_piso_fam | cod_material_domic_fam | cod_agua_canalizada_fam | cod_abaste_agua_domic_fam | cod_banheiro_domic_fam | cod_escoa_sanitario_domic_fam | cod_destino_lixo_domic_fam | cod_iluminacao_domic_fam | cod_calcamento_domic_fam | cod_familia_indigena_fam | ind_familia_quilombola_fam | nom_estab_assist_saude_fam | cod_eas_fam | nom_centro_assist_fam | cod_centro_assist_fam | ind_parc_mds_fam | marc_pbf | qtde_pessoas | peso.fam | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4807986 | 3550308 | 2 | 1 | 5290692.0 | 2018-11-22 | 2018-11-22 | 1400.0 | 2018-11-22 | 1.0 | 1.0 | 2.0 | 1.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 2.0 | NaN | NaN | CRAS JACANA | 3.550300e+10 | 0.0 | 0 | 1 | 550244309203512 |
| 4807987 | 3550308 | 2 | 1 | 5290693.0 | 2018-01-10 | 2018-10-01 | 468.0 | 2018-01-10 | 1.0 | 1.0 | 4.0 | 1.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 2.0 | UBS J VISTA ALEGRE | 2787946.0 | CRAS BRASILANDIA II | 3.550304e+10 | 0.0 | 0 | 2 | 550244309203512 |
| 4807988 | 3550308 | 2 | 1 | 5290694.0 | 2015-02-12 | 2018-10-15 | 30.0 | 2018-10-15 | 1.0 | 1.0 | 5.0 | 2.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 2.0 | NaN | NaN | CRAS ITAIM PAULISTA II | 3.550304e+10 | 0.0 | 1 | 5 | 550244309203512 |
| 4807989 | 3550308 | 2 | 1 | 5290695.0 | 2014-12-03 | 2018-10-01 | 436.0 | 2017-11-21 | 1.0 | 1.0 | 3.0 | 1.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 1.0 | 2.0 | 2.0 | NaN | NaN | CRAS ARTHUR ALVIM | 3.550303e+10 | 0.0 | 0 | 3 | 550244309203512 |
| 4807990 | 3550308 | 2 | 1 | 5290696.0 | 2017-09-13 | 2018-10-24 | 217.0 | 2018-10-24 | 1.0 | 1.0 | 3.0 | 1.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 2.0 | AMA UBS INTEGRADA JARDIM HELENA | 4049934.0 | CRAS SAO MIGUEL | 3.550300e+10 | 0.0 | 0 | 3 | 550244309203512 |
| 4807991 | 3550308 | 2 | 1 | 5290697.0 | 2018-07-30 | 2018-10-02 | 1129.0 | 2018-07-30 | 1.0 | 1.0 | 2.0 | 1.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 2.0 | NaN | NaN | CRAS CIDADE LIDER | 3.550303e+10 | 0.0 | 0 | 1 | 550244309203512 |
| 4807992 | 3550308 | 2 | 1 | 5290698.0 | 2018-02-16 | 2018-10-01 | 0.0 | 2018-02-16 | 1.0 | 1.0 | 8.0 | 5.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 2.0 | UBS INTEGRAL JARDIM MIRIAM II | 7128940.0 | CRAS CIDADE ADEMAR II | 3.550304e+10 | 0.0 | 1 | 1 | 550244309203512 |
| 4807993 | 3550308 | 2 | 1 | 5290699.0 | 2014-10-09 | 2018-10-01 | 162.0 | 2017-10-04 | 1.0 | 1.0 | 3.0 | 1.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 2.0 | NaN | NaN | CRAS JABAQUARA | 3.550300e+10 | 0.0 | 0 | 4 | 550244309203512 |
| 4807994 | 3550308 | 2 | 1 | 5290700.0 | 2006-05-24 | 2018-09-30 | 83.0 | 2017-08-15 | 1.0 | 1.0 | 3.0 | 1.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 3.0 | 1.0 | 2.0 | 2.0 | UBS J NAKAMURA | 2787644.0 | CRAS M BOI MIRIM | 3.550300e+10 | 0.0 | 1 | 1 | 550244309203512 |
| 4807995 | 3550308 | 2 | 1 | 5290701.0 | 2015-05-14 | 2018-10-01 | 445.0 | 2017-08-02 | 1.0 | 1.0 | 4.0 | 2.0 | 5.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.0 | 2.0 | UBS VARGINHA | 2789299.0 | CRAS GRAJAU | 3.550303e+10 | 0.0 | 0 | 3 | 550244309203512 |